#Chinese AI

6 articles

TechJun 28, 20268 min

DeepSeek-V4-Pro-DSpark is not a new model but a speculative-decoding V4-Pro

DeepSeek-V4-Pro-DSpark isn't a new base model. It's the same 1.6T V4-Pro checkpoint plus a DSpark speculative-decoding head (~893GB). What config.json and the DeepSpec repo reveal, and why there's no speed benchmark yet.

LLM DeepSeek Chinese AI MoE Inference Optimization Open Model Speculative Decoding

TechJun 16, 202612 min

Claude Fable 5 suspended: Kimi K2.7 Code & Qwen3.7 Max as Claude Code backends

After a US order pulled Claude Fable 5, which Chinese models drop into Claude Code? Kimi K2.7 Code, Qwen3.7 Max, DeepSeek V4 and GLM-5.1 — constraints, VRAM, benchmark caveats.

AI LLM Chinese AI Kimi Qwen DeepSeek MoE AI Agents

TechApr 24, 2026updated11 min

DeepSeek V4 Preview specs: V4-Pro 1.6T and V4-Flash 284B open under MIT, 1M context, 27% inference FLOPs of V3.2

DeepSeek V4 Preview ships V4-Pro (1.6T/49B active) and V4-Flash (284B/13B active) as open weights under MIT, both with 1M context. CSA+HCA hybrid attention, mHC, and the Muon optimizer cut per-token FLOPs at 1M tokens to 27% of V3.2. Day-one API and chat.deepseek.com mode switch covered.

LLM DeepSeek Chinese AI MoE Open Model AI Agent

TechApr 24, 2026updated14 min

Tencent Hy3-preview (295B) vs Ant Ling-2.6-flash (104B): two open Chinese MoEs released the same week

Two open-weight Chinese MoEs landed within 24 hours: Ant Ling-2.6-flash (104B/7.4B active, 7x token-efficiency claim) and Tencent Hy3-preview (295B/21B active, frontier-tier open weights). Specs, licenses, and how they line up against DeepSeek-V3 and GLM-4.5.

LLM Chinese AI MoE Open Model AI Agent Local LLM OpenRouter

TechApr 23, 2026updated9 min

Xiaomi ships MiMo-V2.5 and MiMo-V2.5-Pro together — 1M omnimodal and 1,000-tool agent, API only

Xiaomi launched two MiMo-V2.5 models at once. MiMo-V2.5-Pro hits SWE-bench Pro 57.2, Claw-Eval 63.8, and τ3-Bench 72.9 — frontier-tier — while MiMo-V2.5 brings native omnimodality plus a 1M context. Both are API-only for now; open weights are promised but unscheduled.

AI LLM Chinese AI MoE AI Agents Multimodal Xiaomi

TechApr 8, 2026updated8 min

GLM-5.1 (Zhipu, 744B / 40B MoE, MIT): 58.4% SOTA on SWE-Bench Pro, 8h / 6,000+ tool calls without degradation

Zhipu AI's GLM-5.1 is a 744B MoE (40B active, 200K context, MIT) targeting long-horizon agent tasks. Hits 58.4% SOTA on SWE-Bench Pro (edging out GPT-5.4 and Claude Opus 4.6) and sustains performance across 8-hour sessions with 6,000+ tool calls without degradation.

AI LLM Chinese AI MoE Open Model AI Agent